NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

CauchyGCN: Preserving Local Smoothness in Graph Convolutional Networks via a Cauchy-Based Message-Passing Scheme and Clustering Analysis

https://doi.org/10.1007/978-3-031-72344-5_4

Liang, Peiyu; Gao, Hongchang; He, Xubin (January 2024, Springer Nature Switzerland)

Full Text Available
Exploring Memory Access Similarity to Improve Irregular Application Performance for Distributed Hybrid Memory Systems

https://doi.org/10.1109/TPDS.2022.3227544

Liu, Wenjie; He, Xubin; Liu, Qing (March 2023, IEEE Transactions on Parallel and Distributed Systems)

With the increasing problem complexity, more irregular applications are deployed on high-performance clusters due to the parallel working paradigm, and yield irregular memory access behaviors across nodes. However, the irregularity of memory access behaviors is not comprehensively studied, which results in low utilization of the integrated hybrid memory system compositing of stacked DRAM and off-chip DRAM. To address this problem, we devise a novel method called Similarity-Managed Hybrid Memory System (SM-HMS) to improve the hybrid memory system performance by leveraging the memory access similarity among nodes in a cluster. Within SM-HMS, two techniques are proposed, Memory Access Similarity Measuring and Similarity-based Memory Access Behavior Sharing. To quantify the memory access similarity, memory access behaviors of each node are vectorized, and the distance between two vectors is used as the memory access similarity. The calculated memory access similarity is used to share memory access behaviors precisely across nodes. With the shared memory access behaviors, SM-HMS divides the stacked DRAM into two sections, the sliding window section and the outlier section. The shared memory access behaviors guide the replacement of the sliding window section while the outlier section is managed in the LRU manner. Our evaluation results with a set of irregular applications on various clusters consisting of up to 256 nodes have shown that SM-HMS outperforms the state-of-the-art approaches, Cameo, Chameleon, and Hyrbid2, on job finish time reduction by up to 58:6%, 56:7%, and 31:3%, with 46:1%, 41:6%, and 19:3% on average, respectively. SM-HMS can also achieve up to 98:6% (91:9% on average) of the ideal hybrid memory system performance.
more » « less
Full Text Available
Zperf: A Statistical Gray-Box Approach to Performance Modeling and Extrapolation for Scientific Lossy Compression

https://doi.org/10.1109/TC.2023.3257517

Wang, Jinzhen; Chen, Qi; Liu, Tong; Liu, Qing; He, Xubin (January 2023, IEEE Transactions on Computers)

Full Text Available
Improving Progressive Retrieval for HPC Scientific Data using Deep Neural Network

https://doi.org/10.1109/ICDE55515.2023.00209

Wang, Jinzhen; Liang, Xin; Whitney, Ben; Chen, Jieyang; Gong, Qian; He, Xubin; Wan, Lipeng; Klasky, Scott; Podhorszki, Norbert; Liu, Qing (April 2023, IEEE)
Locality-based transfer learning on compression autoencoder for efficient scientific data lossy compression

https://doi.org/10.1016/j.jnca.2022.103452

Wang, Nan; Liu, Tong; Wang, Jinzhen; Liu, Qing; Alibhai, Shakeel; He, Xubin (September 2022, Journal of Network and Computer Applications)

Full Text Available
Locality-based transfer learning on compression autoencoder for high-performance lossy compression of scientific data

Wang, Nan; Liu, Tong; Wang, Jinzhen; Liu, Qing; Alibhai, Shakeel; He, Xubin (January 2022, Journal of Network and Computer Applications)

Full Text Available
RBC: A memory architecture for improved performance and energy efficiency

https://doi.org/10.26599/TST.2019.9010077

Liu, Wenjie; Zhou, Ke; Huang, Ping; Yang, Tianming; He, Xubin (June 2021, Tsinghua Science and Technology)

Full Text Available
High-Ratio Lossy Compression: Exploring the Autoencoder to Compress Scientific Data

https://doi.org/10.1109/TBDATA.2021.3066151

Liu, Tong; Wang, Jinzhen; Liu, Qing; Alibhai, Shakeel; Lu, Tao; He, Xubin (January 2021, IEEE Transactions on Big Data)
null (Ed.)
Full Text Available
Design and Evaluation of a Risk-Aware Failure Identification Scheme for Improved RAS in Erasure-Coded Data Centers

https://doi.org/10.1109/TPDS.2020.3010048

Huang, Weichen; Fang, Juntao; Wan, Shenggang; Xie, Changsheng; He, Xubin (July 2020, IEEE Transactions on Parallel and Distributed Systems)

Data reliability and availability, and serviceability (RAS) of erasure-coded data centers are highly affected by data repair induced by node failures. In a traditional failure identification scheme, all chunks share the same identification time threshold, thus losing opportunities to further improve the RAS. To solve this problem, we propose RAFI, a novel risk-aware failure identification scheme. In RAFI, chunk failures in stripes experiencing different numbers of failed chunks are identified using different time thresholds. For those chunks in a high-risk stripe, a shorter identification time is adopted, thus improving the overall data reliability and availability. For those chunks in a low-risk stripe, a longer identification time is adopted, thus reducing the repair network traffic. Therefore, RAS can be improved simultaneously. We also propose three optimization techniques to reduce the additional overhead that RAFI imposes on management nodes' and to ensure that RAFI can work properly under large-scale clusters. We use simulation, emulation, and prototyping implementation to evaluate RAFI from multiple aspects. Simulation and prototype results prove the effectiveness and correctness of RAFI, and the performance improvement of the optimization techniques on RAFI is demonstrated by running the emulator.
more » « less
Full Text Available
Compression Ratio Modeling and Estimation across Error Bounds for Lossy Compression

https://doi.org/10.1109/TPDS.2019.2938503

Wang, Jinzhen; Liu, Tong; Liu, Qing; He, Xubin; Luo, Huizhang; He, Weiming (July 2020, IEEE Transactions on Parallel and Distributed Systems)

Full Text Available

« Prev Next »

Search for: All records